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^ ' Abstract 

Given polynomials f(x),gi{x),hj{x), wc study how to minimize f{x) on the set 

S = {xe K" : hi{x) = ■■■ = h„,,{x) = 0,gi{x) > 0, . . .,g„^,{x) > 0} . 

. Let fmin be the minimum of / on S. Suppose S is nonsingular and fmin is achievable on 

I S, which is true generically. The paper proposes a new semidefinite programming (SDP) 

• . relaxation for this problem. First we construct a set of new polynomials ipi{x), . . . , ipr{x), 

' by using the Jacobian of /, hi , gj , such that the above problem is equivalent to 

min f(x) 

s.t. hi{x) ~ 0, ^^j{x) = 0, 1 < i < "nil, 1 < j < r, 
51 {xY^ ■■■gm, > 0, Vt. e {0, 1}"= . 

> 

QQ ' Then we prove that for all N big enough, the standard iV-th order Lasserre's SDP relax- 

ation is exact for solving this equivalent problem, that is, it returns a lower bound that 

\l ' is equal to fmin- Some variations and examples are also shown. 

CN 

\Q • Key words determinantal varieties, ideals, minors, polynomials, nonsingularity, semidefi- 

, nite programming, sum of squares 
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■ 1 Introduction 



Consider the optimization problem 

min fix) 

S.t. hi{x) = ■ ■ ■ = hrmix) = (1-1) 
9i{x) >0,...,gm2{x)>0 

where f (x) , gi{x) , hj (x) are polynomial functions in x G M". Let 5 be its feasible set and 
fmin be its global minimum. We are interested in finding fmin- This problem is NP-hard, 
even when one of the polynomials is quadratic. 
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A standard approach for solving (jl.ip is semidefinite programming (SDP) relaxations pro- 
posed by Lasserre jH] . It is based on a sequence of sum of squares (SOS) type representations 
of polynomials that are nonnegative on S. The basic idea is, for a given integer A'^ > (called 
relaxation order), solve the SOS program 

max 7 

mi 7712 

S.t. fix) - 7 = YlMx)hi{x) + T.'^jix)9j{x), 

i=i j=i (1-2) 

deg{cl)ihi), degiajgj) <2N 

o"i, . . . , (7^2 are SOS. 

In the above, go{x) = 1, the decision variables are the coefficients of polynomials (pi and aj. 
Here a polynomial is SOS if it is a sum of squares of other polynomials. The SOS program 
p.2p is equivalent to an SDP problem (see [S]). We refer to |2H [22] for more about the 
connection between SDP and SOS programs. Let pi\f be the optimal value of ()1.2p . Clearly, 
Pn < fmin for every N. Using Putinar's Positivstellensatz [23|, Lasserre proved pN fmin 
as — )• oo, under the archimedean condition. A stronger relaxation than (II. ip would be 
obtained by using cross products of gj, which is 

max 7 

s.t. /(x)-7= Yl cl)i{x)hi{x) + CTuix) ■ giy{x), 

i=l,...,mi i^GlO.lj^a (1.3) 

degi^ihi) < 2N, degia^g^) < 2N Vz, u, 
Gy are all SOS. 

Here, denote gu = Si^ ■ ■ ■ Qni-^ ■ Let be the optimal value of (|1.3p . When S is compact, 
Lasserre showed — )• /mm as A^ goes to infinity, using Schmiigen's Positivstellensatz [26]. 
An analysis for the convergence speed oipN-,qN to fmin is given in [191 [27j. Typically, (jl.2p 
and (|1.3p are not exact for (11. ip with a finite A^. Scheiderer [25] proved a very surprising 
result: whenever S has dimension three or higher, there always exists / such that f{x) — fmin 
does not have a representation required in (jl.3p . Thus, we usually need solve a big number 
of SDPs until convergence is met. This is very inefficient in many applications. Furthermore, 
when S is not compact, typically we do not have the convergence oipN,qN to fmin- This is 
another difficulty. Thus, people are interested in more efficient methods for solving (jl.ip . 

Recently, the author, Demmel and Sturmfels [18] proposed a gradient type SOS relaxation. 
Consider the case of (|l.ip without constraints. If the minimum fmin is achieved at a point 
u, then V/(u) = 0, and the problem is equivalent to 

min f(x) s.t. ^ = ... = ^=0. (1.4) 

xeM" OXi OXn 



In |18] . Lasserre's relaxation is applied to solve ()1.4p . It was shown in [18] that a sequence 
of lower bounds converging to fmin would be obtained, and it has finite convergence if the 
gradient ideal, generated by the partial derivatives of /(x), is radical. More recently, Demmel, 
the author and Powers [7] generalized the gradient SOS relaxation to solve (|l.ip by using the 
Karush-Kuhn- Tucker (KKT) conditions of (jl.ip 

mi 7712 

V/(2;) = ^KVhi{x) + ^iijVgj{x), fJ-jgjix) = 0, j = 1, . . . , m2. 
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If a global minimizer of (jl.ip is a KKT point, then (jl.ip is equivalent to 



min f{x) 

X,\,fl 

s.t. = • • • = = 0, 

mi m2 (1.5) 

i=i j=i 
fJ-jOjix) = 0, 5fj(x) > 0, j = 1, . . . ,m2. 

Let {vjy} be the sequence of lower bounds for (jl.Sp obtained by applying Lasserre's relax- 
ation of type (II. 3p . It was shown in [7] that V]\f — )• /mm, no matter S is compact or not. 
Furthermore, it holds that vn = fmin for a finite when the KKT ideal is radical, but it 
was unknown in [7] whether this property still holds without the KKT ideal being radical. A 
drawback for this approach is that the involved polynomials are in (x, A, /x). There are totally 
n + mi + m2 variables, which makes the resulting SDP very difficult to solve in practice. 

Contributions This paper proposes a new SDP type relaxation for solving (jl.ip using KKT 
conditions but the involved polynomials are only in x. Suppose S is nonsingular and fmin is 
achievable, which is true generically. We construct a set of new polynomials ipi{x), . . . , ^Pr{x), 
by using the minors of the Jacobian of /, hi,gj, such that (jl.ip is equivalent to 

min fix) 

s.t. hi{x) = ifj{x) = 0, I < i < nil, 1 < j < r, 
91 {xY^ • • • {xY"^^ > 0, V 1/ G {0, 1}'"2 . 

Then we prove that for all big enough, the standard A^-th order Lasserre's relaxation for 
the above returns a lower bound that is equal to the minimum fmin' That is, an exact SDP 
relaxation for (jl.ip is obtained by using the Jacobian. 

This paper is organized as follows. Section [2] gives the construction of this exact SDP 
relaxation by using Jacobian. Its exactness and genericity are proved in Section [3l Some 
efficient variations are proposed in Section 21 Some examples of how to apply this exact SDP 
relaxation are shown in Section [5l Some conclusions and discussions are made in Section [6l 
Finally, we attach an appendix introducing some basics of algebraic geometry and real algebra 
that are used in the paper. 

Notations The symbol N (resp., M, C) denotes the set of nonnegative integers (resp., real 
numbers, complex numbers). For any t G M, \t\ denotes the smallest integer not smaller than 
t. For integer n > 0, [n] denotes the set {1, . . . ,n}, and [n]k denotes the set of subsets of [n] 
whose cardinality is k. For a subset J of [n], \J\ denotes its cardinality. For x € M", Xj denotes 
the i-th component of x, that is, x = (xi, . . . , x„). For a G N", denote \a\ = ai + ■ ■ ■ + an- 
For X G M" and a G N", x" denotes x°^ • • -x^". The symbol M[x] = ]R[xi, . . . ,x„] (resp. 
C[x] = C[xi, . . . , x„]) denotes the ring of polynomials in (xi, . . . , Xn) with real (resp. complex) 
coefficients. A polynomial is called a form if it is homogeneous. The M[x]<rf denotes the 
subspace of polynomials in M[x] of degrees at most d. For a general set T C M", int{T) 
denotes its interior, and dT denotes its boundary in standard Euclidean topology. For a 
symmetric matrix X, X ^ (resp., X y 0) means X is positive semidefinite (resp. positive 
definite). For u G M.^ , \\u\\2 denotes the standard Euclidean norm. 
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2 Construction of the exact Jacobian SDP relaxation 

Let S be the feasible set of (II. ip and 

m = min(mi + m2, n — 1). (2-1) 

For convenience, we denote h{x) = {hi{x), . . . ,hmi{x)) and g{x) = {gi{x), . . . ,gm2{x))- For 
a subset J = {ji, ■ ■ ■ ,jk} C [m2], dennote 

gji^) = idjiix), - ■ ■ ,gjkix)). 

Let X* be a minimizer of (ll.ip . If J = {ji, . . . , j^} is the index set of gj{x*) = and the 
KKT conditions hold at x* , then there exist Aj and € J) such that 

h{x*)=0, gj{x*) = 0, V/(x*)= ^ A,V/i,(x*) + J^^jV5j(x*). 

ie[mi] jeJ 

The above implies the Jacobian matrix of (/, h,gj) is singular at x* . For a subset J C [m2], 
denote the determinantal variety of {f,h,gj)'s Jacobian being singular by 

Gj = {x G C" : rank5^(2;) < mi + |J|} , = [V/(x) V/i(x) Vgj{x)]. (2.2) 

Then, x* G V{h, gj) n Gj where V{h, gj) := {x G C" : /i(x) = 0, gj{x) = 0} . 

This motivates us to use gj{x) = and Gj to get tighter SDP relaxations for (II. ip . To 
do so, a practical issue is how to get a "nice" description for Gj? An obvious description 
for Gj is that all its maximal minors vanish. But there are totally (^^^^^i) such minors (if 
mi + A; + 1 < n), which is huge for big n, mi, k. Can we define Gj by a set of the smallest 
number of equations? Furthermore, the active index set J is usually unknown in advance. 
Can we get an SDP relaxation that is independent of J? These issues will be discussed in 
the sequel. 

2.1 Minimum defining equations for determinantal varieties 

Let k < n and X = (Xij) he a n x k matrix of indeterminants Xij. Define the determinantal 
variety 

D^^l^ = |X G C"^^' : rankX < t} . 

For any index set / = {zi, . . . , i^} C [n], denote by det/(X) the (zi, . . . , ifc) x (1, . . . , A;)-minor 
of matrix X, i.e., the determinant of the submatrix of X whose row indices are ii, . . . ,ik and 
column indices are 1, . . . ,k. Clearly, it holds that 

Dl'\ = [X e C"^*^ : detiiX) =0 V / G [n]k] . 

The above has (^) defining equations of degree k. An interesting fact is that we do not need 
(fc) equations to define D^'\- Actually nk — k"^ + 1 are enough. There is very nice work 
on this issue. Bruns and Vetter [3] showed nk — + 1 equations are enough for defining 
Z?"_lp Later, Bruns and Schwanzl [2] showed nk — t^ + 1 is the smallest number of equations 
for defining D^^ri- Typically, nk - + I < {fj for big n and k. A general method for 
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constructing nk — + 1 defining polynomial equations for D^^^ was described in Chapt. 5 
of [3]. Here we briefly show how it works for D^'^-^. 

Let r(X) denote the set of all /c-minors of X (assume their row indices are strictly 
increasing). For convenience, for any 1 < ii < • • • < < n, we just denote by [ii, . . . , i^] the 
{ii, . . . X (1, . . . , /c)-minor of X. Define a partial ordering on T{X) as follows: 

k k 

[zi, . . . < [ji, . . . , jfe] n < ji, • • • < ifc, < ^if- 

1=1 1=1 

If / = {ii, . . . , ifc}, we also write / = [ii, . . . , ifc] as a minor in T{X) for convenience. For any 
/ G V[X), define its rank as 

rA;(/) = max 1^ : / = > • • • > every /^^^ G r(X)| . 

The maximum minor in T{X) is [n — + 1, . . . , n] and has rank nk — k"^ + 1. For every 
1 < £ < n/c - ^2 + 1, define 

m{X) = J2 det,(X). (2.3) 

7e[n]fe,rfc(/)=^ 

Lemma 2.1 (Lemma (5.9), Bruns and Vetter [3]). It holds that 

Dl^^^ = {x G C"^'= : i]i{X) = 0, £ = 1, . . . ,nfc - A:^ + l} . 

When k = 2, D"'^ would be defined by 2n — 3 polynomials. The biggest minor is [n — 1, n] 
and has rank 2n — 3. For each £ = 1, 2, . . . , 2n — 3, we clearly have 

l<il<i2<n: ii+i2=i+2 

Every 2-minor of X is a summand of some r]i{X). 

When A; = 3, D2'^ can be defined by 3n — 8 polynomials of the form rii{X). For instance 
of n = 6, the partial ordering on T{X) is shown in the following diagram. 



125 ^ 126 ^ 136 ^ 146 ^ 156 ^ 256 



123 




456 



In the above an arrow points to a bigger minor. Clearly, we have the expressions 

?7i(X) = [1,2,3], ??2(X) = [1,2,4], 7?3(X) = [1,2,5] + [1,3,4], 

r/4(X) = [1,2,6] + [1,3,5] + [2,3,4], i^^{X) = [1,3,6] + [1,4,5] + [2,3,5], 
%(X) = [1, 4, 6] + [2, 3, 6] + [2, 4, 5], in{X) = [1, 5, 6] + [2, 4, 6] + [3, 4, 5], 
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778 (X) = [2, 5, 6] + [3, 4, 6], r?9(X) = [3, 5, 6], r?io(X) = [4, 5, 6]. 

Every above r]i{X) has degree 3. Note that the summands [ii, 12,13] from the same r]i{X) 
have a constant summation ii + 12 + 13- Thus, for each ^ = 1, . . . , 3n — 8, we have 

m{X)= ^ [ii,i2,k]- 

I<ii<i2<i3<n: ii+i2+i3=£+5 

When /c > 3 is general, D^'^-^ can be defined hy nk — k"^ + 1 polynomials of the form rji^X). 
For each £ = 1,2, ... ,nk — k^ + 1, we similarly have the expression 

r]i{X) = ^ 

l<il<-<ik<n:ii+-+ik=i+{''+^)-l 

2.2 The exact Jacobian SDP relaxation 

For every J = {ji, . . . , j^} C [777-2] with k < m — mi, by applying formula (|2.3p . let 

r](, . . . , rii^n{J) where len{J) = 71(7711 + A; + 1) — {mi + k + lf + 1 

be the set of defining polynomials for the determinantal variety Gj defined in (I2.2p of the 
Jacobian of (/, h, gj) being singular. For each i = 1,. . . , len{J), define 

= ■ JI gj{x), where ^ = [m2]\J. (2.4) 

Using the product Ylj^jc gj{x) in the above is motivated by a characterization of critical 
points in [151 . For simplicity, list all possible polynomials v?/(x) in (|2.4p sequentially as 

ipi{x), ip2{x), . . . , ipr{x), where r= len{J). (2.5) 

.} Cl[m2\,\J\<m—mi 

Now wefine the variety 

W = {x(^C : hi{x) = ■ ■ ■ = hm^ {x) = ipi{x) = --- = ipr{x) = 0} . (2.6) 
If the minimum fmin of p.ip is achieved at a KKT point, then (jl.ip is equivalent to 

min fix) 

S.t. /li(x) = • • • = /imi(x) = 0, (-27) 

ip^[x) = ■■■ = ip,.{x) = 0, 
(7.(x) >0, Vt^G {0,1}™^. 

Here, we denote gu{x) = gi{xY^ ■ ■ ■ ^m.j (a^)'^™^ . 

To construct an SDP relaxation for ()2.7p . we need to define localizing moment matrices. 
Let q{x) be a polynomial with deg{q) < 2N. Define symmetric matrices ^4^^^ such that 

Q(x)[x]rf[x]J = Yl d = N-\deg{q)/2]. 

a£N":\a\<2N 
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Then the A^-th order locahzing moment matrix of q is defined as 

a6N":|a|<2Ar 

Here y is a moment vector indexed by a G N" with |q| < 2N . Moreover, denote 

aeN":|Q!|<deg(/) aeN":|a|<deg(/) 



The A^-th order Lasserre's relaxation for ()2.7p is the SDP 

In^ ■■= min Lf{y) 

s-t- -^if^(y) = 0, « = 1,. . . 
4^'^(y) = o, i = i,...,r, 

4f^(y)>=0, Vz. G {0, lr^ yo = 1. 



(2. 



Compared to Schmiidgen type Lasserre's relaxation, by (j2.5p . the number of new equations 
in (|2.8p is r = 0^2"*^ • n • (mi + m2)^ . That is, r is of linear order in nmi for fixed m2, but 
is exponential in m2- So, when m2 is small or moderately large, (j2.8p is practical; but for 
big 1712, (|2.8p becomes more difficult to solve numerically. Now we present the dual of (|2.8p . 
Define the truncated preordering p(^) generated by gj as 

^'('^^ = I a,(x)g,(x) : deg(cT,5.) < 2iV I , (2.9) 

and the truncated ideal J^^-* generated by hi and (/J,- as 



mi 



Then, as shown in Lasserre |16j . the dual of ()2.8p is the following SOS relaxation for ()2.7p : 



(2) 

s.t. /(^)_^e/W + pW. ^^-^'^ 



/j^' := max 7 



Note the relaxation (j2.1ip is stronger than (jl.3p . Let /* be the optimal value of (j2.7p . Then, 
by weak duality, we have the relation 

fP < fS^ < r. (2.12) 

We are going to show that when is big enough, (|2.8p is an exact SDP relaxation for 

,(2) _ ,(1) 
N ~ JN 



(j2.7p . i.e., /l?^ = f^^ = f*. For this purpose, we need the following assumption. 



Assumption 2.2. (i) nii < n. (ii) For any u £ S, at most n — mi of gi{u), . . . , gm2{u) 
vanish, (in) For every J = {ji, ■ ■ ■ , jk} C [1712] with k < n — mi, the variety V{h,gj) = 
{x € C"" : h{x) = 0, gj{x) = 0} is nonsingular (its Jacobian has full rank on V{h,gj)). 
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Theorem 2.3. Suppose Assumption \2.S\ holds. Let f* be the minimum of \2. 7p . Then there 
exists an integer N* > such that f^^ = = /* for all N > N* . Furthermore, if the 
minimum fmin of lil.l]) is achievable, then f^^ = f^^ = fmin for N > N* . 

Theorem 1 2 . 3 1 will be proved in Section [3l When the feasible set S of (jl.ip is compact, the 
minimum fmin is always achievable. Thus, Theorem 12.31 implies the following. 

Corollary 2.4. Suppose Assumption \2.S\ holds. If S is compact, then f^^ = f^^ = fmin for 
N big enough. 

A practical issue in applications is how to identify whether (12. Sp is exact for a given 
N. This would be possible by applying the flat-extension condition (FEC) [B]. Let y* be 

a minimizer of (j2.8p . We say y* satisfies FEC if rankLg^''(y*) = rankLg^ ^\y*) for every 
u. When FEC holds, (|2.8p is exact for p.ip . and a finite set of global minimizers would be 
extracted from y*. We refer to [13] for a numerical method on how to do this. A very nice 
software for solving SDP relaxations from polynomial optimization is GloptiPoly 3 |l4j which 
also provides routines for finding minimizers if FEC holds. 

Now we discuss how general the conditions of Theorem 12.31 are. Define 
Bd{S) = {/ G M[x]<rf : inf f{u) > -ooj . 
Clearly, Bd{S) is convex and has nonempty interior. Define the projectivization of S as 

SP^^ = [ieR^'^^ : hi{x) = --- = hmAi) = 0, h{S:)>0,...,gmM>0} ■ (2-13) 

Herep denotes the homogenization of p, and x = {xq,xi, . . . ,x„), i.e., p{x) = x'^^^^^ p{x / xq) . 
We say S is closed at oo if 

SP''^ n {XQ > 0} = closure {S'P''^ n {xo > 0}) . 

Under some generic conditions. Assumption 12.21 is true and the minimum fmin of (II. ip is 
achievable. These conditions are expressed as non-vanishing of the so-called resultants Res 
or discriminants A, which are polynomial in the coefficients of /, hi,gj. We refer to Appendix 
for a short introduction about Res and A. 

Theorem 2.5. Let /, hi,gj be the polynomials in and S be the feasible set. 

(a) If mi > n and Res{hi-^ , ■ ■ ■ , hi^+i) 7^ for some {ii, . . . , in+i}, then 5 = 0. 

(b) If mi < n and for every {ji, . . . ,jn-mi+i} C [771,2] 

Res{hi, . . . ,Kn,gj^, ■ ■ ■ ,gj„_^^+^) / 0, 
then item (ii) of Assumption W^ holds. 

(c) If mi < n and for every {ji, . . . , j^} C [777,2] with k < n — mi 

A{hi,...,hmi_,gji,...,gj^) / 0, 
then item (Hi) of Assumption W^ holds. 
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(d) Suppose S is closed at oo and f G Bd{S). If the resultant of any n of h^°^ , 
nonzero (only when mi + m2 ^ n), and for every {ji, . . . with k < n — mi — 1 

\ ( fhom vhom vhom hom hom\ _/_ rt 

i"-! T--i'>'mi 'yji '■■■'yjfe > T U, 

then there exists v ^ S such that fmin = f{v). Here p^°"^ denotes p's homogeneous part 
of the highest degree. 

(e) If f € Bd{W) and A(/''°™) / 0, then the minimum of f{x) in M" is achievable. 

Theorem 12.51 will be proved in Section [3l Now we consider the special case of (jl.ip having 
no constraints. If fmin > —oo is achievable, then (jl.ip is equivalent to (jl.4p . The item 
(e) of Theorem 12.51 tells us that this is generically true. The gradient SOS relaxation for 
p.4p described in [18] is the same as (j2.1ip for the unconstrained case. The following is an 
immediate consequence of Theorem 12.31 and item (e) of Theorem 12.51 

Corollary 2.6. If S = M", f{x) has minimum 

frrnn > -OO, and A(/'^°"^) 7^ 0, then the 
optimal values of \2. 9^) and \2.11]) are equal to fmin if ^ is big enough. 

Corollary 12.61 is stronger than Theorem 10 of [18], where the exactness of gradient SOS 
relaxation for a finite order N is only shown when the gradient ideal is radical. 



3 Proof of exactness and genericity 

This section proves Theorems 12.31 and 12.51 First, we give some lemmas that are crucially used 
in the proof. 

Lemma 3.1. Let K be the variety defined by the KKT conditions 



K 



hi{x) = fijgjix) = 0, V(i,/) G [mi] x [m2] 

If Assumption holds, then W = where 

Kx = {x G C" : (x, X, fi) £ K for some A, fi}. 

Proof. First, we prove W C K^. Choose an arbitrary u G W. Let J = {j £ [ni2] ■ gjiu) = 0} 
and k = \J\. By Assumption 12.21 mi + k < n. Recall from (12. 2p that 

B^{x) = [Vf{x) Vh{x) Vgj{x)] . 

Case mi + k = n By Assumption 12. 2^ the matrix II{u) = [V/i(u) Vgj{u)~^ is nonsingular. 
Note that II{u) is now a square matrix. So, II{u) is invertible, and there exist Aj and 
G J) such that 

■mi 

Vf{u) = Y,^i'^Hu) +Y,H'^gj{u) (3.2) 

i=i j&J 

Define /ij = for j J, then we have u G Kx. 
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Case mi -\- k < n By the construction of polynomials (pi{x) in (j2.5p . some of them are 
^^ix) := rii{B-^{x))- JJ 9j{x), i = I, . . . ,n{mi + k + I) - {rm + k + if + 1. 

3 a J" 

So the equations ipi{u) = imply every above (pj [u) = (see its definition in (12. 4p ). Hence 
{u) is singular. By Assumption 1 2 . 2 1 the matrix H{u) is nonsingular. So there exist Aj and 
G J) satisfying (j3.2p . Define = for j J, then we also have u € K^- 

Second, we prove C W . Choose an arbitrary u G Kx with (n, A, /i) G J^. Let 
/ = {j g [m2] : c/j(n) = 0}. If / = 0, then ^ = 0, and [V/('u) Vh{u)\ and ^■^('u) are 
both singular, which implies all ^pi{u) = and u G W. If / 7^ 0, write / = {zi, . . . ,it}. Let 
J = {ji, . . . , j;^.} C [777-2] be an arbitrary index set with ?77i + k <m. 
Case I ^ J At least one j G J'^ belongs to /. By choice of /, we know from (12. 4p 

^i{u) = r^,{B\u))- fjg,(n) = 0. 

Case I J Then fij = for all j G J'^. By definition of K, the matrix B"'(u) must be 
singular. All polynomials ^{{x) vanish at u by their construction. 

Combining the above two cases, we know all ^{{x) vanish at u, that is, ipi{u) = ■ ■ ■ = 

ipr{u) =0. So U£W. □ 

Lemma 3.2. Suppose Assumption \2.SM s true. Let W he defined in \2. 6\) . and T = {x G M" : 
djix) > 0,j = 1, • • • , 777,2} • Then there exist disjoint subvarieties Wq, Wi, . . . , Wr of W and 
distinct fi, . . . , 7;^ G M such that 

w = WoUWiU---uWr, VFonr = 0, vf, nr/0,i = i,...,r, 

and f{x) is constantly equal to Vi on Wi for i = 1, . . . , r. 

Proof. Let K = Ki U • • • U be a decomposition of irreducible varieties. Then /(x) is 
equaling a constant Vi on each Ki, as shown by Lemma 3.3 in [7|. By grouping all Ki for 
which Vi are same into a single variety, we can assume all Vi are distinct. Let Wi be the 
projection of Ki into x-space, then by Lemma l3. II we get 

w = Wi[j---yjWr. 

Let Wi = ZariWi). Applying Zariski closure in the above gives 

W = Zar{W) = VFi U • • • U W^. 

Note that f{x) still achieves a constant value on each Wi. Group all Wj for which WjCiT = 
into a single variety Wo (if every Wj n T 7^ we set Wq = 0). For convenience, we still write 
the resulting decomposition as W = WqUWiL) ■ ■ ■ UWr. Clearly, Wq n T = 0, and for 7 > 
the values Vi are real and distinct (because 7^ Wi CiT C M" and f{x) has real coefficients). 
Since f{x) achieves distinct values on different Wi, we know Wi must be disjoint from each 
other. Therefore, we get a desired decomposition for W. □ 
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Lemma 3.3. Let /o,/i, ■ ■ ■ ,Ik be ideals o/M[x] such that V{Ii) R V{Ij) = for distinct i,j, 
and / = /q n /i n • • • n /fc. Then there exist qq, ai, . . . , ak € ^[x] satisfying 

al + -- - +al-l € I, ai e f] Ij. 

ij^je{o,...,k} 

Proof. We prove by induction. When A; = 1, by Theorem \A.2\ there exist p £ Io,q £ Ii such 
that p + q = 1. Then = p,ai = q satisfy the lemma. 

Suppose the lemma is true for k = t. We prove it is also true for k = t + 1. Let 
J = Iq n • • • n It- By induction, there exist bo, . . . ,bt € M[x] such that 

bl + --- + bf -1 e J, bi e Pi Ij, i = 0,...,t. 

i^je{o,...,t} 

Since V{It+i) is disjoint from V{J) = V{Io)U-- •uy(/t), by Theorem IX2l there exist p € h+i 
and q £ J such that p + q = 1. Let = bip for i = 0, . . . , t and at+i = q. Then 

ai£ f] Ij, i = 0,...,t + l. 

i^j&{0,...,t+l} 

Since {p + q)'^ = 1,1 = It+i D J, we have pq G I, (6q + • • • + 6^ - G /, and 

oo + a? + • • • + - 1 = (6o + • • • + ^? - " 2pg G ^, 

which completes the proof. □ 

Theorem 3.4. Suppose Assumption \2.2\ is true and let f* be the minimum of \2. 7\ ). Then 
there exists an integer N* > such that for every e > 

/(x)-r + e G /(^*) + p(^*). (3.3) 

Proof. Generally, we can assume /* = 0. Decompose W as in Lemma |3.2[ Then 

/* = min{z;i, . . . ,-17^}. 

Reorder Wi such that vi > V2 > • • • > Vr = 0. The ideal 

Iw = {hi,. . . ,hmi,^l, - ■ ■ ,V>r) (3.4) 

has a primary decomposition (see Sturmfels [30l Chapter 5]) 

Iw = Eo n El n ■ ■ ■ n Er 

such that each ideal Ei C M[x] has variety Wi = V{Ei). 

When i = 0, we have V^{Eq) n T = (T is defined in Lemma l3.2p . By Theorem IA.3( 
there exist SOS polynomials Ti, satisfying 

-1= ^ 'Ty-gy{x) mod Eq. 
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Thus, from / = i(/ + 1)2 - i(/ - 1)2, we have 



!/g{0,l}™2 

Tu ■ gu mod Eq 



!^G{0,1}™2 

for certain SOS polynomials tv- Let 

i/e{o,i}'"2 

Clearly, if A/'q > is big enoug h, then ao £ P^^") for ah e > 0. Let qo = f + e - ao G £^o, 
which is independent of e. 

For each i = 1, . . . ,r — 1, Vi > and v~^f{x) — 1 vanishes on Wi. By Theorem lA.il there 
exists ki > such that {v^^f{x) - 1)''^ G Ei. Thus, it holds that 



Si{x) := VW(l + Kri/(x)-l))'^' = V^E mod i?, 



Let (Tj = Si(x)2 + e, and qi = f + e — ai G -Bj, which is also independent of e > 0. 

When i = r, Vr = and /(x) vanishes on Wr- By Theorem lA.H there exists kr > such 
that fix)''- G Thus we obtain that 

k — 1 

s,(x) := V^(l + e-V(2;))'^' = ^/^ (^^^)e~'fixy mod Er . 

Let ar = Sr(x)2, and qr = f + e — ar G -E^- Clearly, we have 

qrix) = f^c,{e)f{x)'-+^ 
j=0 

for some real scalars Cj{e). Note each /(x)^'"''-' G 

Applying Lemma [331 to ideals Eq, Ei, . . . , Er, we can find oq, • • • , G M[x] satisfying 
ao H h - 1 G /vF, G Q -E^. 

i7^ie{0,l,...,r} 

Let a = aoOQ + cJiaf + • • • + Ura^, then 

r 

/(3;) + e-a = + e - ai)aj + (/ + e)(l - a^). 

1=0 
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Since qi = f + e — ai € Ei, it holds that 

r 

(/ + e - (Ti)af G r\Ej = Iw 

j=0 

For each < i < r, qi is independent of e. There exists A^i > such that for ah e > 

{f + e-ai)aj G /(^i), i = 0, 1, . . . , r - 1. 
For i = r,qr = f + € — ar depends on e. By the choice of qr, it holds that 

j=o 

Note each f^-'^^al G since g E'^,. So, there exists iV2 > such that for all e > 

U + e-ar)al G /(^^). 

Since 1 — af — • • • — G Iw ; there also exists > Q such that for all e > 

{f + e){l-al al) G I^^^). 

Combining the above, we know if A^* is big enoug h, then f{x) + e-a G I^^*) for ah 
e > 0. From the constructions of (Ji and Oj, we know their degrees are independent of e. So, 
a G for all e > if A^* is big enough, which completes the proof. □ 

Theorem l3.4l is a kind of Positivstellensatz of representing /(x) — /mm+fi) which is positive 
on S for all e > 0, by the preordering generated by gj modulo the ideal Iw in P-4p of variety 
W . Usually, we can not conclude f{x) — fmin G I^^*^ + P^-^*) by setting e = 0, because the 
coefficients of the representing polynomials of f{x) — fmin + ^ in I^^*^ go to infinity as 

e — )• (see Sr{x) in the proof). It is possible that /(x) — fmin l'^^'^ + P*^^) for every > 0. 
Such a counterexample is Example 15.11 However, Theorem 13.41 shows that the degree bound 
N* required for representing /(x) — fmin + e is independent of e. This is a crucial property 
justifying the exactness of the SDP relaxation (j2.8p . Now we present its proof below. 

Proof of Theorem \2.3\ By Theorem 13. 4^ there exists N* such that for every e > 

/(x)-(r-e) G 

Since f^} , f^l are the optimal values of ()2.8p and ()2.1ip respectively, we know 

r-e< A'i< A'i<r. 

Because e > is arbitrary, the above implies f^l = f^l = f*. Since the sequence {/}^^} is 
monotonically increasing and every < < /* by ()2.12p . we get = = /* for 
all N > N* . If the minimum fmin of (ll.ip is achievable, then there exists x* £ S such that 
fmin = f{x*). By Assumption 12. 2| we must have x* G W. So x* is feasible for (j2.7p . and 
/* = fmin- Thus, we also have /^^^ = /^^^ = fmin for all A^ > A^*. □ 
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Last we prove Theorem 12.51 by using the properties of resultants and discriminants de- 
scribed in Appendix. 

Proof of Theorem \2. 5\ (a) If Res{hi-^, . . . , hi^^^) ^ 0, then the polynomial system 

K{x) = ■■■ = /ii„+i(x) = 

does not have complex solution. Hence, V{h) = and consequently 5 = 0. 

(b) For a contradiction, suppose n — mi + 1 of gj vanish at u £ S, say, gj^^, . . . ,gj„_^ 
Then the polynomial system 

hi{x) = -- - = hmAx) = 9hix) = ■■■ = = 

has a solution, which contradicts Res{hi, . . . , hmi,gji, ■ ■ ■ ,gj„-mj^+i) / 0- 

(c) For every J = {ji, ■ ■ ■ ,jk} C [012] with k < n — mi, if 

A(^i, . . . ,hmj_,gjj_,- ■ ■,gjj + 0, 

then the polynomial system 

h^{x) = ■■■ = hm.^ (x) = gj^ (x) = • • • = gj^ (x) = 

has no singular solution, i.e., the variety V{h,gj) is smooth. 

(d) Let fo{x) = f{x) — fmin- Then /o lies on the boundary of the set 

Pd{S) = G Bd{S) : p{x) > OVx G s}. 

Since S is closed at 00, by Prop. 6.1 of [20], /o G dP(i{S) implies 

= min fo{x). 

xG5P'"M|x||2=l,a;o>0 

Let u = {uq,ui, . . . ,Un) 7^ be a minimizer of the above, which must exist because the feasible 
set is compact. We claim that uq ^ 0. Otherwise, suppose uq = 0. Then u = (ui, . . . , n„) / 
is a minimizer of 

= min f''°"'{x) 

s.t. hf"^{x) = ■■■ = h^^^{x) = 0, 
g1""\x)>0,...,g';,"^{x)>0. 

Let ji, ■ ■ ■ ,jk G [^-2] be the indices of active constraints. By Fritz- John optimality condition 
(see Sec. 3.3.5 in [l]), there exists (Aq, Ai, . . . , Am^,/ii, . . . , fik) / satisfying 

mi k 

AoV/'^°-(n) + X:A..V/i,'^°-(^x) + • • • + E/^^V(7j';-(tx) = 0, 

i=l 1=1 

fhom^^^ = h^^\u) = ■■■ = h^^^iu) = g^°"\u) = ■■■ = g^^'^iu) = 0. 
Thus, the homogeneous polynomial system 

f^°-\x) = hi""\x) = ■■■ = = g^ri^) = ■■■= 9jr(^) = 

has a nonzero singular solution. Since the resultant of any n of fihom^ghom nonzero, we 
must have irii + k < n — 1. So the discriminant 

AI fhom T^hom r^hom „ hom\ 

U jil-l n-mi ) fi'ii ) • • • ) ffjfe j 

is defined and must vanish, which is a contradiction. So uq 7^ 0. Let v = u/uq, then u G S^^^ 
implies v £ S and f{v) - fmin = '^/o('") = 0. 

Clearly, (e) is true since it is a special case of (d). □ 
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4 Some variations 

This section presents some variations of the exact SDP relaxation ()2.8p and its dual ()2.1ip . 



4.1 A refined version based on all maximal minors 

An SDP relaxation tighter than (j2.8p would be obtained by using all the maximal minors 
to define the determinantal variety Gj in (j2.2p . while the number of equations would be 
significantly larger. For every J = {ji, . . . , j^} C [m2\ with mi + k < m, let 

Ti ,. . . 

be all the maximal minors of B'^ix) defined in ([221). Then define new polynomials 

^^:f{x) = Tfix)■llgj{x), i = !,...,£. (4.1) 

Jl 



List all such possible ^/(x) as 



'ipi{x),il}2{x), ...,i^t{x), where t= V ( , , ^ 

J(Z[m2\,\J\<m~mi ^' ' 

Like p.7p . we formulate (jl.ip equivalently as 

min fix) 

s.t. hi{x) = ^j{x) = 0, i G [mi], j G [t], (4.2) 
>0, Vi/e {0,1^2. 

The standard A^-th order Lasserre's relaxation for the above is 
min Lf{y) 

s.t. 4f)(y) = 0,4^)(y)=0, (4.3) 
4f^(y)^0, VZ.G {0,1}™^, yo = 1. 

Note that every (pf in (]2.4p is a sum of polynomials like ip'/ix) in (14. ip . So the equations 
L[^,\y) = in ([33]) implies L^^\y) = in ([231). Hence, (gSl) is stronger than Its 
dual is an SOS program like (|2.1ip . Theorem 12.31 then implies the following. 

Corollary 4.1. Suppose Assumption \2.2\ is true, and the minimum fmin of is achiev- 

able. If N is big enough, then the optimal value of ^.3\) is equal to fmin- 



4.2 A Lasserre type variation without using cross products of gj 

If the minimum fmin of (jl.ip is achieved at a KKT point, then (jl.ip is equivalent to 



min fix) 

S.t. hi{x) = ipj{x) = 0, i e [mi], J G [r], (4-4) 
9i{x) > 0, . . . ,gm2{x) > 0. 
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The standard A^-th order Lasserre's relaxation for (|4.4p is 
min Lf{y) 

s.t. 4f)(2/)=0,LiJ^(y)=0,iGM,iG[r], (4.5) 
L^I^Hy) ^ 0, i = 0, 1,. . . ,7712, yo = 1- 

The difference between (j4.5p and (j2.8p is that the cross products of gj{x) are not used in 
(j4.5p . which makes the number of resulting LMIs much smaller. Similar to P^^\ define the 
truncated quadratic module M^^^ generated by gi as 

M(^) = |f^tTi(x)5i(x) : degiaiQi) < 2iv| . (4.6) 

The dual of (j4.5p would be shown to be the following SOS relaxation for (j4.4p : 

max 7 

s.t. /(^)_^e/W+MW. ^^-^^ 

Clearly, for the same A^, (j4.7p is stronger than the standard Lasserre's relaxation (jl.2p . To 
prove ()4.5p and (14. 7p are exact for some A'^, we need the archimedean condition (AC) for S, 
i.e., there exist > 0, 4>i{x), . . . ,(pmiix) S M[x] and SOS so{x), . . . ,Sm2{x) G M.[x] such that 

mi 1712 

= '^cl)i{x)hi{x) + '^Sj{x)gj{x). 

i=l j=0 

Theorem 4.2. Suppose Assumption \2.S\ and the archimedean condition hold. If N is big 
enough, then the optimal values of |^.5[ ) a7ic? ?[ ) are equal to fmin- 

To prove Theorem \4:.2\ we need the following. 

Theorem 4.3. Suppose Assumption \2.^ and the archimedean condition hold. Let f* be the 
optimal value of (f^.^[ ). Then there exists an integer N* > such that for every e > 

f{x) - /* + e E + M(^*). (4.8) 

Proof. The proof is almost same as for Theorem 13.41 We follow the same approach used 
there. The only difference occurs for the case i = and V^{Eq) n T = 0. By Theorem I A. 31 
there exist SOS polynomials 7]^, satisfying 

-2= ^ rj^- g,. mod Eq. 

i/e{o,i}™2 

Clearly, each + rjy ■ g'[^ ■ ■ ■ g'^^ is positive on S. Since AC holds, by Putinar's Positivtel- 
lensatz (Theorem IA.4p . there exist SOS polynomials Oj^i such that 



2^^'^"' ^"^^ ^'^'^^^ (/7l, . . . , /imi; 



i=0 
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Hence, it holds that 



-1= J2 {^^'^"'sA {hi,. . . ,hmi) + Eo 

i/G{0,l}'"2 ^ ^ 

= ^ ^ Ou,i gi mod £'0. 

1=0 \u(^{0,l}"^2 J 

The second equivalence above is due to the relation 

{hi, . . . ,hmi) C /vK C £"0- 
Letting Tj = Yli^e{o i}™2 (^u,i, which is clearly SOS, we get 

-1 = To + ri5ti H h Tm.2gm2 mod Eq. 

The rest of the proof is almost same as for Theorem 13.41 □ 

Proof of Theorem \4-2\ For convenience, still let fi^\ fj^^ be the optimal values of (j4.5p and 
(|4.7p respectively. From Theorem 14.31 there exists an integer A^* such that for all e > 

/(x)-(r-e) G /(^*) + m(^*). 

Like in the proof of Theorem 12.31 we can similarly prove f^^ = f^^ = f* for all N > N*. 
Since AC holds, the set S must be compact. So the minimum fmin of (jl.ip must be achievable. 
By Assumption 12.21 we know /* = fmin, and the proof is complete. □ 

4.3 A simplified version for inactive constraints 

Suppose in (jl.ip we are only interested in a minimizer making all the inequality constraints 
inactive. Consider the problem 

min f(x) 

S.t. hi{x) = ---= hra^ {x) = 0, (4-9) 

gi{x) > 0,...,gm2{x) > 0. 
Let M be a minimizer of (j4.9p . If V{h) is smooth at u, there exist Aj such that 

Vf{u) = XiVhiiu) + ■■■ + Xm^VhmM- 
Thus, u belongs to the determinantal variety 

G/, = {x : rank [V/(x) V/i(x)] < mi} . 

If mi < n, let i;^!, . . . , 0s be a minimum set of defining polynomials for Gh by using formula 
(j2.3p . If mi = n, then Gh = and we do not need these polynomials; set s = 0, and [s] is 
empty. Then, ()4.9p is equivalent to 

min fix) 

s.t. hi{x) = 0,(/>j(x) = 0, i e [mi],j E [s], (4-10) 
gi{x) > 0, . . . , (2;) > 0- 
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The difference between (j4.10p and (j2.7p is that the number of new equations in (j4.10p is 
s = o(nmi^, which is much smaller than r in (j2.7p . So, (|4.10p is preferable to (j2.7p when 
the inequality constraints are all inactive. The A^-th order Lasserre's relaxation for (|4.1Up is 

min Lf{y) 

s.t. 4f)(y)=0,4^)(y)=0,iGM,jG[s], (4.11) 

L^g^\y) ^ 0, i = 1, . . . ,7712, yo = 1- 

A tighter version than the above using cross products of gj is 
min Lf{y) 

s.t. 4f)(7/) = 0,4^)(y) = 0,iGM,iG[s], (4.12) 
4^\y)hO, V7/G{0,l}'"^yo = l. 

Define the truncated ideal J^^^ generated by hi{x) and <j)j as 



The dual of (j4.1ip is the SOS relaxation 



max 7 
s.t. /(x) -7 G +M(^). 

The dual of (j4.12p is the SOS relaxation 

max 7 
s.t. /(x) -7 G 

The exactness of the above relaxations is summarized as follows. 



(4.13) 



(4.14) 



Theorem 4.4. Suppose the variety V{h) is nonsingular and the minimum fmin of Iji4-^ is 
achieved at some feasible u with every gj{u) > 0. If N is big enough, then the optimal values 
of ^4-1^ o^f^d ^4-14^ ti'^e equal to fmin- If iiT' addition, the archimedean condition holds for 
S, the optimal values of and (^TT^ are also equal to fmin for N big enough. 

Proof. The proof is almost same as for Theorems 12.31 and 14.21 We can first prove a decom- 
position result like Lemma 13.21 and then prove there exists A^* > such that for all e > 
(like in Theorem 13. 4p 

f{x)-fm^n + e e J(^*)+P(^*). 

Furthermore, if AC holds, we can similarly prove there exists N* > such that for all e > 
(like in Theorem. 14. 3p 

f{x)-fm^n + e G J(^*)+M(^*). 

The rest of the proof is almost same as for Theorems 12.31 and 14.21 Due to its repeating, we 
omit the details here for the cleanness of the paper. □ 
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5 Examples 



This section presents some examples on how to apply the SDP relaxation p.Sp and its dual 
(j2.1ip to solve polynomial optimization problems. The software GloptiPoly 3 [H] is used to 
solve dlSl) and ^TT\\\ . 

First, we consider an unconstrained optimization. 

Example 5.1. Consider problem 

Q o Q AO OA. (\ 

"YY^ 1 T1 ry*^ I /y* ^ I ry* ^ I ry* rv* I ry* ry* I ry* < ry* ry* ry*^ 



This example was studied in [18]. Its global minimum is zero. We apply SDP relaxation 
(|2.8p of order = 4, and get a lower bound —9.7 x 10~^. The minimizer (0,0) is extracted. 
In [18j . it was shown that f{x) is not SOS modulo its gradient ideal Igrad- But for every 
e > 0, /(x) + e = modulo Igrad for some SOS whose degree is independent of e 

(see equation (10) of [IE])- But its coefficients go to infinity as e — t- 0. This shows that the 
optimal value of (j2.1ip might not be achievable. □ 

Second, we consider polynomial optimization having only equality constraints. 

min fix) s.t. hAx) = ■ ■ ■ = hmix) = Q. (5.1) 

When V{h) is nonsingular, its equivalent version (12. 7p reduces to 



min fix) 
s.t. hi{x) = ■ ■ ■ = hm{x) = 



Z detiF{x)=0, (-+2),...,(n-f)(m + l) 



(5.2) 



Ie[n]m+i 
sum{I)=£ 



In the above sum{I) denotes the summation of the indices in /, Fi^x) = [V/(rE) V/i(x)], 
and det/ F[x) denotes the maximal minor of F[x) whose row indices are in /. When m > n, 
there are no minor equations in (j5.2p . 



Example 5.2. Consider the optimization 

min Xi + X2 + x^ + 'ix\x^xi — 2>{x\ix\ + x\) + x^i^xt + x\) + xi(^x\ + X2)) 
S.t. xi + X2 + a;3 — 1 = 0. 

The objective is the Robinson polynomial, which is nonnegative everywhere but not SOS |24j . 
So the minimum fmin = 0. We apply SDP relaxation (j2.8p of order A'' = 4, and get a lower 
bound -4.4600 x 10"^. The minimizer (1/3, 1/3, 1/3) is also extracted. Applying Lasserre's 
relaxation (II. 2p of orders = 3, 4, 5, 6, 7, we get lower bounds respectively 

-0.0582, -0.0479, -0.0194, -0.0053, -4.8358 x 10"^ 



We can see that (jl.2p is weaker than ()2.8p . It is not clear whether the sequence of relaxations 
(|1.2p converges or not for this problem, since the feasible set is noncompact. But, the f{x) 
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here is not SOS modulo the constraint in this example. Otherwise, suppose there exist 
polynomials cr(x) being SOS and (j){x) such that 

f{x) = a{x) + (l){x){xi + X2 + X3 - 1). 

In the above, replacing every Xi by Xi/{xi + X2 + x^) gives 

f{x) = (xi + X2 + X3fa{x/{xi +X2 + X3)). 

So, there exist polynomials pi, ■ ■ ■ ,Pk, Qi, ■ ■ ■ ,Qe such that 

2 2 

f{x)=pl + ---+pl + - T2+--- + 



(Xi+X2 + X3)2 ' ' (xi+X2 + X3)2^ 

Since the objective f{x) does not have any pole, every qi must vanish whenever X1 + X2 + X3 = 
0. Thus Qi = {xi + X2 + x-^Ywi for some polynomials Wi. Hence, we get 

fix)=pi + ---+p'^. + wl + --- + wj 

is SOS, which is a contradiction. □ 

Third, consider polynomial optimization having only a single inequality constraint. 

min f(x) s.t. g(x) > 0. (5.3) 
Its equivalent form (12. 7p becomes 
min fix) 

s-t- =0, i = 1, . . . , n, 5r(x) > 0, (5.4) 

^ /a|M^_^^)=0, £ = 3,...,2n-l. 

' V "Xi axj axj axi / ' ' ' 

i+j=l ^ J J y 

There are totally 3(n — 1) equalities and a single inequality. 
Example 5.3. Consider the optimization 

42,24,6 q222 
mm X-|^X2 ~r X-^OC^^ ~t~ oOC -^oc^^x^ 

s.t. x^ + x| + x| < 1. 

The objective is the Motzkin polynomial which is nonnegative everywhere but not SOS [24j . 
So its minimum is 0. We apply SDP relaxation (j2.8p of order = 4, and get a lower bound 
— 1.6948 X 10~^. The minimizer (0, 0, 0) is also extracted. Now we apply Lasserre's relaxation 
p.2p . For orders = 4, 5, 6, 7, 8, (jl.2p returns the lower bounds respectively 



-2.0331 X 10"^, -2.9222 x 10~^ -8.2600 x 10"^ -4.2565 x 10"^ -2.3465 x 10"^ 

We can see that (jl.2p is weaker than (|2.8p . The sequence of (|1.2p certainly converges since the 
feasible set is compact. However, the objective does not belong to the preordering generated 
by the ball condition. This fact was kindly pointed out to the author by Glaus Scheiderer 
(implied by his proof of Prop. 6.1 in |25], since the objective is a nonnegative but non-SOS 
form vanishing at origin). □ 
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Example 5.4. Consider Example 15.31 but the constraint is the exterior of the ball: 



42, 24| 6 q222 
IXllIl X-^Xq^ ~i~ X'^X'2 ~i "-^3 ^jX-^Xt^X'^ 



s.t. x\ + x\ + x\> 1. 



Its minimum is still 0. We apply SDP relaxation (j2.8p of order = 4, and get a lower bound 
1.7633 X 10~^ (its sign is not correct due to numerical issues). Now we compare it with 
Lasserre's relaxation ([L2]). When iV = 4, ([O]) is not feasible. When N = 5,6,7,8, (fL2|) 
returns the following lower bounds respectively 

-4.8567 X 10^ -98.4862, -0.7079, -0.0277. 

So we can see (II. 2p is much weaker than (12. 8p . It is not clear whether (11.2p converges or not 
for this problem, since its feasible set is unbounded. □ 

Last, we show some general examples. 

Example 5.5. Consider the following polynomial optimization 

min X? + 

s.t. x^ - 1 > 0, 

xl - MxiX2 - 1 > 0, 
xl + MxiX2 - 1 > 0. 

This problem was studied in (H [E]. Its global minimum is 2 + ^M{M + ^/M^T4:). Let 
M = 5 here. Applying (j2.8p of order = 4, we get a lower bound 27.9629 which equals 
the global minimum, and four global minimizers (±5.1926, ±1.0000). However, if we apply 
the Lasserre's relaxation either ()1.2p or ()1.3p . the best lower bound we would obtain is 2, no 
matter how big the relaxation order is (see Example 4.5 of [7]). □ 

Example 5.6. Consider the polynomial optimization 

111111 ju-^ju<2^ ~t~ a^2'*'3 ~r JiJ'^JiJ'^ \jJu'^Ju(2^Jl/'\ 

s.t. 1 - xf > 0, 1 - 2;2 > 0, 1 - x| > 0. 

The objective is a nonnegative form being non-SOS |24l Sec. 4c]. Thus its minimum is 0. 
We apply SDP relaxation (j2.8p of order = 6, and get a lower bound —9.0752 x 10~^. A 
minimizer (0,0,0) is also extracted. Now we apply Lasserre's relaxation of type (jl.3p . For 
A^ = 6, 7, 8, (|1.3p returns the lower bounds respectively 

-3.5619 X 10"^ -1.0406 x 10"^ -7.6934 x 10"^. 

So we can see that (jl.3p is weaker than (|2.8p . They converge in this case, since the feasible 
set is compact. However, the objective does not belong to the preordering generated by the 
constraints, which is implied by the proof of Prop. 6.1 of [25] (the objective is a nonnegative 
but non-SOS form vanishing at origin). □ 
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6 Some conclusions and discussions 



This paper proposes the exact SDP relaxation (12. 8p and its dual ()2.1ip for polynomial op- 
timization (|l.ip by using the Jacobian of its defining polynomials. Under some generic 
conditions, we showed that the minimum of (jl.ip would be found by solving the SDP (12. Sp 
for a finite relaxation order. 

The results of this paper improve the earlier work [3 [18], where the exactness of gradient 
or KKT type SOS relaxations for a finite relaxation order is only proved when the gradient 
or KKT ideal is radical. There are other conditions like boundary hessian condition (BHC) 
guaranteeing this property, like in [15\ [T7] . In [T7] , Marshall showed that the gradient SOS 
relaxation is also exact for a finite relaxation order by assuming BHC, in unconstrained 
optimization. In [15], Hiep proposed a KKT type SOS relaxation using critical variety for 
constrained optimization, and its exactness for a finite relaxation order is also presented 
under BHC. In this paper, the exactness of (12. 8p and (12. lip for a finite is proved without 
the conditions like radicalness or BHC. The only assumptions required are nonsingularity of 
S and the minimum fmin being achievable (the earlier related work also requires this), but 
they are generically true as shown by Theorem 12.51 

We would like to point out that the KKT type SOS relaxation proposed in ^ using 
Lagrange multipliers is also exact for a finite order, no matter the KKT ideal is radical or 
not. This would be proved in a similar way as we did in Section [31 First, we can get a 
similar decomposition for the KKT variety like Lemma 13.21 Second, we can prove a similar 
representation for f{x) — /* + e like in Theorem 13.41 with degree bounds independent of e. 
Based on these two steps, we can similarly prove its exactness for a finite relaxation order. 
Since the proof is almost a repeating of Section [3l we omit it for the cleanness of the paper. 

The proof of the exactness of (12. Sp provides a representation of polynomials that are posi- 
tive on S through using the preordering of S and the Jacobian of all the involved polynomials. 
A nice property of this representation is that the degrees of the representing polynomials are 
independent of the minimum value. This is presented by Theorem 13. 4[ A similar representa- 
tion result using the quadratic module of S is given by Theorem 14.31 

An issue that is not addressed by the paper is that the feasible set S has singularities. 
If a global minimizer x* of (II. ip is singular on S, then the KKT condition might no longer 
hold, and x* W. In this case, the original optimization (jl.ip is not equivalent to (j2.7p . and 
the SDP relaxation (12. Sp might not give a correct lower bound for fmin- It is not clear how 
to handle singularities generally in an efficient way. 

Another issue that is not addressed by the paper is the minimum fmin of (|l.ip is not 
achievable, which happens only if S is noncompact. For instance, when S" = M^, the poly- 
nomial xf + {xiX2 — 1)^ has minimum but it is not achievable. If applying the relaxation 
()2.8p for this instance, we would not get a correct lower bound. Generally, this case will not 
happen, as shown by items (d), (e) of Theorem 12.51 In unconstrained optimization, when 
fmin is not achievable, excellent approaches are proposed in [9l[ini[28]. It is an interesting 
future work to generalize them to constrained optimization. 

An important question is for what concrete relaxation order A^* the SDP relaxation (12. Sp 
is exact for solving (|l.ip . No good estimates for N* in Theorem 12.31 are available currently. 
Since the original problem (II. ip is NP-hard, any such estimates would be very bad if they 
exist. This is another interesting future work. 
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A Some basics in algebraic geometry and real algebra 

In this appendix, we give a short review on basic algebraic geometry and real algebra. More 
details would be found in the books [U lllj . 

An ideal / of M.[x] is a subset such that I ■ M.[x] C /. Given polynomials pi, . . . ,pm G 
{pi, - • • ,Pm) denotes the smallest ideal containing every pi, which is the set piM[x] + • • • + 
Pml^i^;]. The ideals in C[x] are defined similarly. An algebraic variety is a subset of C" that 
are common complex zeros of polynomials in an ideal. Let / be an ideal of M[x]. Define 

V{I) = {x G C" : p{x) =0 Vp G /}, 
Vr{I) = {x G M" : p{x) =0 Vp G /}. 

The V{I) is called an algebraic variety or just a variety, and is called a real algebraic 

variety or just a real variety. Every subset T C C" is contained in a variety in C". The 
smallest one containing T is called the Zariski closure of S, and is denoted by Zar{T). In the 
Zariski topology on C^, the varieties are called closed sets, and the complements of varieties 
are called open sets. A variety V is irreducible if there exist no proper subvarieties Vi, V2 of 
V such that V = ViUV2- Every variety is a finite union of irreducible varieties. 

Theorem A.l (Hilbert's Strong NuUstellensatz). Let I C M[x] be an ideal. If p & M[x] 
vanishes on V{I), then p^ ^ I for some integer k > 0. 

If an ideal / has empty variety V{I), then 1 G /. This is precisely the Hilbert's weak 
NuUstellensatz. 

Theorem A.2 (Hilbert's Weak NuUstellensatz). Let I C M[x] be an ideal. IfV{I) = 0, then 
1 G /. 

Now we consider / to be an ideal generated by polynomials having real coefficients. Let 
T be a basic closed semialgebraic set. There is a certificate for V^{I) Ci T = . This is the 
so-called Positivstellensatz. 

Theorem A. 3 (Positivstellensatz, [29]). Let I C M[x] be an ideal, and T = {x G M" : 

gi{x) > 0, . . . ,gr{x) > 0} be defined by real polynomials gi. //Vk(/) PlT = 0, then there exist 
SOS polynomials CTy such that 

— 1 = ^ <^u ■ 9]^ • ■ • dr"^ mod I. 

Theorem A. 4 (Putinar's Positivstellensatz, j23j). Let I be an ideal o/M[x] and T = {x G 
: gi{x) > 0, . . . ,gr{x) > 0} be defined by real polynomials gi. Suppose there exist R > 
and SOS polynomials S(){x), . . . , Sm{x) such that (the archimedean condition holds) 

R- \\x\\l = so{x) + si{x)gi{x) -\ \- Sm{x)gmix) mod I. 

If a polynomial /(x) is positive on Vm.{I) H T, then there exist SOS polynomials ai such that 

f{x) = fTo(x) + ai{x)gi{x) ^ h CFm{x)gni{x) mod I. 
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In the following, we review some elementary background about resultants and discrimi- 
nants. More details would be found in [5l EJ [30] . 

Let /i, . . . , /„ be homogeneous polynomials in x = (xi, . . . , The resultant Res{fi, . . . , /„) 
is a polynomial in the coefficients of /i, . . . , /„ satisfying 

i?es(/i,...,/n) = ^ /i(^z) = --- = /„(n) = 0. 

The resultant i?es(/i, . . . , fn) is homogeneous, irreducible and has integer coefficients. When 
f{x) is a single homogeneous polynomial, its discriminant is defined to be 

A(/) = Res{^,...,^). 

OXi OXn 

Thus, we have the relation 

A(/) = ^ V/(n) = 0. 

The discriminants and resultants are also defined for inhomogeneous polynomials. Let 
fo, fi, ■ ■ ■ , fn be general polynomials in x = (xi, . . . , Their resultant i?es(/o, /i, . . . , /„) 
is then defined to be Res{fo{x), fi{x), . . . ,fn{x)), where each fi{x) = Xq°^^'^^^ f [x / xq) is the 
homogenization of fi{x). Clearly, if the polynomial system 

Mx)=Mx) = --- = fn{x) = 

has a solution in C", then the homogeneous system 

Mx) = fi{i) = ■ ■ ■ = fn{x) = 

has a nozero solution in C""^^, and hence Res{fo, fi, . . . , fn) = 0. The reverse is not always 
true, because the latter homogeneous system might have a solution at infinity xq = 0. If f{x) 
is a single nonhomogeneous polynomial, its discriminant is defined similarly as A(/). 

The discriminants are also defined for several polynomials. More details are in |20l Sec. 3]. 
Let . . . , be forms in x = (xi,...,x„) of degrees di,...,dm respectively, and 

m < n — 1. Suppose at least one di > 1. The discriminant for fi, . . . , fm, denoted by 
A(/i, . . . , fm), is a polynomial in the coefficients of fi such that 

A(/i,...,/^) = 

if and only if the polynomial system 

fl{x) = ■ ■ ■ = fm{x) = 

has a solution u ^ such that the matrix [V/i(ti) • • • V/m,(u)] does not have full rank. 
When m = 1, A(/i, . . . , fm) reduces to the standard discriminant of a single polynomial. 

When /i, . . . , /m are nonhomogeneous polynomials in x = (xi, . . . , x„) and m < n, the 
discriminant A(/i, . . . , fm) is then defined to be A(/i(x), . . . , /m(x)), where each fi{x) is the 
homogenization of fi{x). 
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